Picture for Zelin Zhao

Zelin Zhao

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

Clarify Before You Draw: Proactive Agents for Robust Text-to-CAD Generation

Add code
Feb 03, 2026
Viaarxiv icon

MV-S2V: Multi-View Subject-Consistent Video Generation

Add code
Jan 27, 2026
Viaarxiv icon

PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation

Add code
Dec 30, 2025
Viaarxiv icon

ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

Add code
Dec 28, 2025
Viaarxiv icon

CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization

Add code
Dec 22, 2025
Figure 1 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Figure 2 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Figure 3 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Figure 4 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Viaarxiv icon

Grounding and Enhancing Grid-based Models for Neural Fields

Add code
Apr 06, 2024
Viaarxiv icon

CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model

Add code
Oct 10, 2023
Viaarxiv icon

End-to-end View Synthesis via NeRF Attention

Add code
Aug 01, 2022
Figure 1 for End-to-end View Synthesis via NeRF Attention
Figure 2 for End-to-end View Synthesis via NeRF Attention
Figure 3 for End-to-end View Synthesis via NeRF Attention
Figure 4 for End-to-end View Synthesis via NeRF Attention
Viaarxiv icon

Tracking Objects as Pixel-wise Distributions

Add code
Jul 15, 2022
Figure 1 for Tracking Objects as Pixel-wise Distributions
Figure 2 for Tracking Objects as Pixel-wise Distributions
Figure 3 for Tracking Objects as Pixel-wise Distributions
Figure 4 for Tracking Objects as Pixel-wise Distributions
Viaarxiv icon